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PROBABILITY AND ITS APPLICATION TO ANTISUBMARINE WARFARE 


Introduction 


One of the most important characteristics of modern ASW (Antisubmarine 
Warfare) is the existence of so many uncertainties and unknowns. Examples 
of these "unknowns" abound in almost every aspect of this warfare area. 

To a Tactical Coordinator on a search aircraft the "unknown" may be the 
evasive maneuvers an enemy submarine skipper will employ, how many of the 
sonobuoys he has aboard his aircraft are defective, or which portion of 

a search region offers the best prospects for a contact. To a CIC officer 
the presence of a submarine within detection range may be unknown because 
of excessive ambient noise; at the time a contact is gained it may be 
unknown to him whether the contact is in fact a submarine, and then the 
target location accuracy may be unknown because of uncertainties in 
underwater sound propagation. 

Because of the existence of so many unknowns, many Naval Officers who 
work in ASW are tasked with making action recommendations and action 
decisions, which are based on predictions of the unknown in the light of 
all that is known. Therefore, success in the overall ASW effort is 
directly related to the capability within the ASW-related officer corps 
in two key areas: (i) making predictions concerning the unknown, and 
(ii) understanding the concomitant degree of eeanacy and degree of 
uncertainty of such predictions, so that the best possible decisions may 
then be made. 

Probability theory provides a logical and scientific basis for predic- 


tion and decision making. It is not surprising, therefore, that 





probabilistic concepts and the language of probability are in widespread 
everyday usage in the ASW community today. Even the basic references 
used in ASW, such as ComTac publications, ASW handbooks, Tactical 
Reference manuals, TacAids, ASRAPS and AESPAWS, use the language and 
concepts of probability theory freely. 

Because of the widespread use of probability in the tasks assigned 
to the ASW-related Officer, his effectiveness should be enhanced by an 
increased understanding of probability theory and a better appreciation 
of its benefits in ASW. This report constitutes a "tutorial paper" 
designed to provide a means by which ASW-related officers may better 
understand application of probability in modern ASW. The objective of 
this project has been to distill from the general field of probability 
those elements that are vital in ASW applications, and to describe these 
elements, along with tvpical examples of their application, in an 
organized and readable presentation that is accessible to any ASW-related 
Officer, with the only prerequisites being basic algebra and geometry 
as applied to tactical situations in his working environment. 

This paper has been organized into the format of an academic course, 
including lesson plans, study guides and suggested textbooks. This 
organization was adopted because it is expected that this paper will 
serve as the basis for a future short course that will be available to 
the Fleet through the Continuing Education Office at the Naval Postgraduate 
School. Because of the possible misunderstanding that may result from 
the "course format" adopted here, it is emphasized that the content of 
this paper does not constitute a Naval Postgraduate School course. It is 
intended that an official course be developed from these notes; this, 


however, involves an evolutionary process, including testing of the 


materials with actual students, and subsequent revisions. 
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The Lesson Plans provide the objectives of each unit; they indicate 
what is to be learned, the depth required, and the behavioral goals. The 


Study Guides serve as the basic textbook. In addition, Schaum's Outline 


of Theory and Problems of Probability by S. Lipschutz (1965) and Schaum's 


Outline of Theory and Problems of Statistics by M. R. Spiegel (1961), 


both by the McGraw-Hill Book Company, are very helpful; these texts 
are inexpensive and easy to follow, as they contain many samples with 
detailed solutions. Reference is made to the appropriate sections in 
the Schaum's texts at the end of each chapter. 

The primary material presented is the probability theory required 
for successful ASW application, and examples to illustrate the applica- 
tion. The probability theory is preceded by a presentation of sufficient 
Statistical procedures to serve as "first principles" on which the 


probability is based. 





Lesson Plan No. S-V-l ins ERUCEOL - 


Time: 65 min. 


Date: 


Subject Area: Probability and Its 
Applications to Anti- 
= Submarine Warfare 
Topic: Data Reduction, Frequency 
Distribution 


Reference: ASW Study Guide; Schaum's 
Outline Series, Theory 
and Problems of Statistics 


Title: "Sample Data Reduction and Frequency Distributions" 


Materials: Transparencies of Table 1, Table 2, Table 3, Figure l, 
Figure 2, and Figure 3 


Objectives: (1) 


(2) 


Acquaint the student with the concepts of statistical 
population and samples from the population; the 
student should be able to explain these concepts by 
means of simple examples. 


Discuss the basic terms required to describe frequency 
distributions, and show mathematically how they are 
utilized. The student should be able to determine the 
frequency distribution from a given set of raw data. 


Introduction: Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 


Presentation: Discuss the following points: 


1s 


Ze 


Population 
Sample 


Methods of describing a sample and estimating 
characteristics of the population 


Raw data 

Arrays 

Classes or categories 

Class frequency and frequency table 
Class intervals and class limits 
Class boundaries 


SUZeron width Of a class interval 





il. 


2s 


1 


14. 


i 


Ge 


ies 


Class mark 

General rules for forming frequency distributions: 
a. The range 

b. Number of class intervals 


c. Number of observations falling into each class 
interval 


_ Histograms and frequency polygons 


Relative frequency distributions 

Cumulative frequency distributions or ogives 
Relative cumulative frequency distributions 
Types of frequency curves 

a. Symmetrical or bell-shaped 

b. Skewed to the right 

Gameekewed to the ert 

d. U-shaped 

e. Bimodal 


£. Multimodal 


Summary: Review material covered, reemphasizing key points. Ask 
questions to determine effectiveness of instruction. 
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Lesson Plan No. 


Time: 115 min. 


S-V-2 Instructor: 


Subject Area: Probability and Its 
Applications to Anti- 
submarine Warfare 


Date: Topic: Measures of Central Tendency 
and Dispersion 
Reference: ASW Study Guide; Schaum's 
‘ Otmliinc Serres, Uncory 
and Problems of Statistics 
Title: "Special Measures of Sample Data--Central Tendency and Dispersion" 


Materials: Transparency of Figures 4 through 6 


Objectives: 


Introduction: 


Presentation: 


(1) 


(2) 


Define central tendency and its measurement from sample 
data. Illustrate and develop the sample mean, median, 
and mode. The student shall be able to caiculate the 
mean, median, and mode from given sample data. 


Define dispersion and its measurement from sample data. 
Lllustrate, and develop the listed measures of disper- 
sion so that the student is capable of calculating 

the range, standard deviation and variance from given 
sample data, 


Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 


Discuss the following points: 


Ly 


ae 


Index or subscript notation 
Summation notation 

Measures of central tendency 
a. Mean 


(1) (Arithmetic) Mean 
(2) Weighted (arithmetic) mean 


b. Median 

c. Mode 

The concept of dispersion of data 
Specific measures of dispersion 
a. Range 


b. Standard Deviation 
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(1) Formulation of | 
(2) Short method of computing 


c. Variance 


d. Sigma-values for Normal probability distributions 
with corresponding probability values. 
Review material covered, reemphasizing key points. Ask 


Summary : 
questions to determine effectiveness of instruction. 
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Lesson Plan No. S-V-3 Instructor: 

Time: 20 min. Subject Area: Probability and Les 
Applications to Anti- 
submarine Warfare 

Date: Topic: Combinatorial Analysis 

Reference: ASW Study Guide; Schaum's 
Outline Series, Theory 
and Problems of Statistics 

Title: “Combinatorial Analysis" 

Objective: Develop the concept of COMBINATORIAL ANALYSIS by defining and 
illustrating, with a simple example, the basic elements of 
this analysis. The student should be able to perform 
combinatorial analysis on a given example by utilizing the 
fundamental counting principle, and calculating permutations/ 
combinations to find the number of possible outcomes for 


the example. 


Introduction: Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 


Presentation: Discuss the following points: 
1. The fundamental counting principle 
2. Factorial n 
3. Combinations 
4. Permutations 


Summary: Review material covered, reemphasizing key points. Ask 
questions to determine effectiveness of instruction. 





Lesson Plan No. 


Time: 150 min. 


Date: 


S-V-4 ENStructor: 


Subject Area: Probability and Its 
Applications to Anti- 
submarine Warfare 


Topic: Elementary Probability Theory 


Reference: ASW Study Guide; Schaum's 
Outline Series, Theory 
Andee boo rems Of vwotallSteLics: 
and Schaum's Outline Series, 


Theory and Problems of 
Probabilit 


Title: "Elementary Probability Theory" 


Materials: Transparencies of Figures 5 through 12 


Objectives: (1) 


(2) 


(3) 


Develop student intuition in the basic concepts of 
probability, i.e. his insight into what probability is. 
Develop theconcepts of basic probability through the 

use of the proportional areas notion of probability. 

The student should be able to compute simple probabilities 
given sample data and simple statement of necessary 
conditions. 


Develop the concept of CONDITIONAL PROBABILITY as an 
improved probability resulting from having previously 
eliminated part of the overall sample space to be 
"drawn on," through use of prior information. The 
student should be able to compute conditional 
probabilities utilizing any of the three listed methods. 


Develop the concept of STATISTICAL INDEPENDENCE, and 
delineate between this and MUTUALLY EXCLUSIVE EVENTS. 
Also, show the utility of statistical independence of 
events in computing a simple joint probability. Given 
two events, the student should be able to state 
whether or not they are statistically independent. 


Introduction: The instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 


Presentation: Discuss the following points: 


re: 


Elementary Probability Theory 
a. Background and concepts of probability 
(1) "Odds" or ratio concept 


(2) Proportional areas concept 
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g. 


Sample Space 
Probability Experiment 
Outcomes 

Events 

Probability Function 


Probability Complement 


2. Conditional Probability 


a. 


b. 


Concept of 
Methods of computing 
(1) By definition 


(2) "Stating and Solving'"--"Simply writing 
it down" 


(3) Bayes! Law 


3. Statistical Independence 


a. 


be 


C. 


Independent Events; Probability of Independent 
events 


Independent Events vs. Mutually Exclusive Events 


Implications/applications of Statistical Independence 


Summary: Review material covered, reemphasizing key points. Ask 
questions to determine effectiveness of instruction. 
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Lesson Plan No. S-V-5 Instructor: 
mame: LOO min. Subject Area: Probability and Its 


Applications to Anti- 
i submarine Warfare 


Date: Topic: Random Variables 


Reference: ASW Study Guide; Schaum's 
Outline Series, Theory 


and Problems of Probability 


Title: "Random Variables" 

Materials: Transparencies of Table 5, Table 6 and Figures 15 through 19 

Objectives: Introduce the concept of random variables, and develop 
random variables as a device for assigning names to the out- 
comes of a probability experiment. The student should be 
able to demonstrate how a random variable is used by means 


of examples. 


Introduction: Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 


Presentation: Discuss the following points: 

1. Random variable 
a. discrete 
b. continuous 
c. density function 
d. cumulative distribution function (CDF) 
e. mean 
f. variance 
g. standard deviation 


Summary: Review material covered, reemphasizing key points. Ask 
questions to determine effectiveness of instruction. 





Lesson Plan No. 


mime: 320° min. 


Date: 


S-V-6 ins eruc tor: 


Subject Area: Probability and Its 
Applications to Anti- 
submarine Warfare 


Topic: Basic Statistical Distributions 


Reference: ASW Study Guide; Schaum's 
Ouelines series, Theory 
and Problems of Statistics; 
and Schaum's Outline Series, 


Theory and Problems of 
Probability 


fuerte: “Basic Statistical Distributions" 


Materials: Transparencies of Table 7 and Figures 20 through 29 


Objectives: 


Introduction: 


Presentation: 


(1) 
(2) 


(3) 


Present examples of some useful statistical distributions. 


The student should be able to give a simple description 
of each of the distributions presented, and compute 
probabilities for Bernoulli, Binomial, Uniform, Chi- 
Square, Exponential, Poisson and Normal examples 

(with the exception of the Bivariate Normal and Time- 
Varying Normal, which are presented only for student 
appreciation of "state-of-the-art" applications}. 


It is intended that this lesson serve to extend the 
student's understanding of probability functions, 
random variables, and density functions and cumulative 
density functions. 


Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 


Discuss the following points: 


iE 


Background on the basics of. specifying a statistical 
distribution, drawing the majority of this information 
directly from the previous lesson plan. 

Bernoulli Distribution 

Binomial Distribution 

Unitorm Distribution 

Normal Distributions 


a. Normal Distribution 


b. Bivariate Normal Distribution 
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ec. Time-Varying Normal Distribution 
6. Chi-Square Distribution 
7. Exponential Distribution 


8. Poisson Distribution 


Summary: Review material covered, reemphasizing key points. Ask 
questions to determine effectiveness of instruction. 
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STUDY GUIDE 


PROBABILITY AND ITS APPLICATIONS 


TO ANTISUBMARINE WARFARE 





INTRODUCTION 

Probability theory originated in the early seventeenth century with 
investigations of various games of chance. Since then many scholars 
have made contributions to the theory. Probability theory was finally 
axiomatized in the twenties and thirties of this century. This axiomatic 
development, called Modern Probability Theory, provided for a precise 
formulation of probabilistic concepts, which enabled the theory to be 
placed on a firm mathematical foundation. 

The importance of probability has increased significantly in the 
past twenty years. Today the notions of probability, and the closely 
related subject statistics, appear in almost every discipline. Probability 
is used routinely by many Naval Officers in the line of duty, and plays 
an important role in Antisubmarine Warfare, especially in the area of 
search and detection. 

This study guide is designed for an introductory course in probability 
and applications to antisubmarine warfare, with basic algebra as the only 
prerequisite. The guide may serve as a text for such a course,as lesson 
plans are also provided, or aS a Supplement. In addition, since this 
Study guide is complete and self-contained, it could be used for self- 
Study. 

Chapters I, II, and III of the study guide cover sample data Pee eet: 
frequency distributions, associated measures of central tendency and 
disperison, and combinatorial analysis. This leads naturally to 
elementary probability theory, conditional probability, statistical 


independence and applications, in Chapter IV. Chapter V covers random 





variables. Discrete and continuous random variables are covered, with 
emphasis on the use of tables to determine numerical probabilities. 
Chapter VI covers the Bernoulli, Binomial, Uniform, Normal, Chi-square, 
Exponential and Poisson distributions, which are commonly used in 
antisubmarine warfare applications. Since developing understanding of. 
the applications of these distributions to ASW is the primary "payoff" 
for this course, Chapter VI is significantly longer and has more 
applications than the first five chapters. 

Each chapter has clear statements of pertinent definitions and 
principles, together with illustrations and other descriptive material. 
This is followed by solved problems, which serve to illustrate and 
amplify the theory, bring into sharp focus those fine points without 
which the student feels himself on unsafe ground, and provide the repeti- 


tion of basic principles so vital to effective learning. 





CHAPTER L: FREQUENCY DISTRIBUTION 


POPULATION AND SAMPLE 


In collecting data on characteristics of a group of individuals 
or objects, such as heights of students in a university, or the number 
of defective and non-defective sonobuoys produced in a factory on a 
given day, it is often impossible or impractical to observe the entire 
Sreup, especially if it is large. Instead of examining the entire 
group, called the population, one examines a small part of the group 
called a sample. If a sample is representative of a population, important 
conclusions about the population can be reached from analysis of the 
sample. Inferences based on sample data are not absolutely certain 
because in practice it is usually not known beforehand whether the 
sample is indeed representative of the population, so the term probability 
is used in describing conclusions. The term statistics is used in 


connection with conditions under which such an inference is made. 


RAW DATA 


Raw data are collected data which have not been organized numerically. 
An example is the set of heights of 100 students obtained from Naval 
Academy records. The heights of the 100 students, with repetitions 
included, are raw data which constitute a sample, and the population 
from which the sample is taken is the set of heights of the students 


from the entire listing of the Naval Academy records. 





ARRAYS 


An array is an arrangement of raw numerical data in ascending or 


descending order of magnitude. 


CLASSES OR CATEGORIES, CLASS FREQUENCY AND FREQUENCY TABLE 


In order to comprehend what masses of collected raw data really 
mean, and to draw conclusions from these data, it is useful to distribute 
the data into classes or categories. The number of individual data 
points belonging to a class is called the class frequency. A tabular 
arrangement of data by classes, together with the corresponding class 
frequencies, is called a frequency table. Table l is the frequency table 
of the heights (recorded to the nearest inch) of the 100 Naval Academy 


students. 


Number of 
Seucenie 





Lote hk) 100 


feele 1; Heiechts of 100 Naval Academy Stucents 





CLASS INTERVALS AND CLASS LIMITS 


The symbol defining a class, such as 60 - 62 in Table 1, is called a 
class interval. The end numbers, 60 and 62, are called. class limits; 
the smaller number 60 is the lower class limit and the larger number 62 
is the upper class limit. A class interval which has no upper, or no 
ever. or neither an upper nor a lower class limit is called an open 
class interval. For example, an open class interval is appropriate for 


describing torpedoeswith a range of "10,000 yards or greater." 


CLASS BOUNDARIES 


in table I the heights are recorded to the nearest inch, and the class 
interval 60 - 62 includes all measurements from 59.5000 EQ702 5000 
inches. These numbers, indicated briefly by the exact numbers 59.5 and 


62.5, are called class boundaries; the smaller number 59.5 is the lower 


class boundary and the larger number 62.5 is the upper class boundary. 


SIZE OR WIDTH OF A CIASS INTERVAL 


The size or width of a class interval is the difference between the 
lower and upper class boundaries and is referred to as the class width, 
class size, or class length. For the data in Table 1, for example, the 


elass width is 62.5 - 59.5 = 65.5 - 62.5 = 74.5 - 71.5 = 3. 


CLASS MARK 


The class mark is the midpoint of the class interval; it is obtained 
by adding the lower and upper class limits and dividing by two. Thus the 


class mark of the interval 60 - 62 is (60 + 62)/2 = 61. The class mark 


is also called the class midpoint. 





GENERAL RULES FOR FORMING FREQUENCY DISTRIBUTIONS 


1. Determine the largest and smallest numbers in the raw data 
and thus find the range (the length of the interval from 
the smallest to the largest number). 

2. Divide the range into a convenient number of class intervals 
having the same size. The number of class intervals is 
usually taken between 5 and 20, depending on the data. When- 
ever possible, class intervals are also chosen so that the 
class marks or midpoints coincide with actually observed data. 
However, class boundaries usually do not coincide with 
actually observed data. 

3. Determine the number of observations falling into each class 
interval; i.e., find the class frequencies. As an example of 
a frequency distribution, consider the wages in dollars per week 
paid to enlisted personnel aboard a particular destroyer. 
Suppose the wages ranged from $50.00 to $120.00 per week 
with 8 personnel receiving wages between $50.00 and $60.00, 

10 between $60.00 and $70.00, 16 between $70.00 and $80.00, 

14 between $80.00 and $90.00, 5 between $100.00 and $110.00 
and 2 between $110.00 and $120.00. A possible formulation of 
the corresponding frequency distribution is presented in 

Table 2 with the range from $50.00 to $119.99, in 7 ten dollar 
class interval widths and the total number of personnel listed 


for each class interval. 





Wwaces Dine elep gaeelnd 
(de lties-= Bolte tea 


serecnnel 
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Table 2: Frequency Distribution of the weekly haces 
ion tare of Gs =Ealicsted Personnebebosrcd 


fet he ee SO oe 


a Destroyer 


With reference to Table 2 determine: 
1. The lower limit of the sixth class. Ans. $100.00 
2. The upper limit of the fourth class. Ans. $ 89.99 
3. The class mark of the third class. 
Class mark of 3rd class = 4($70.00 + $79.99) = $74.995 


(For most practical purposes this is rounded to $75.00) 


4. The class boundaries of the fifth class. 
Lower class boundary of 5th class = 4($90.00 + $89.99) = $89.995 
Upper class boundary of 5th class = 4($99.99 + $100.00) = $99.995 


5. The size of the fifth class interval 
Size of 5th class interval = upper boundary of 5th class - lower 
boundary of 5th class = $99.995 - $89.995 = $10.00 


HISTOGRAMS AND FREQUENCY POLYGONS 


Histograms and frequency polygons are graphical representations of 


frequency distributions. 





1. A histogram consists of a set of rectangles having: 
(a) Bases on a horizontal axis (the X axis) with centers 
at the class eae and lengths equal to the class sizes. 
(b) Areas proportional to class frequencies. 

2. A frequency polygon is a line graph of class frequency plotted 
against class mark. It can be obtained by connecting midpoints 
of the tops of the rectangles in the histogram. 

Figure 1 shows a histogram and frequency polygon developed from the data 


listed in Table l. 


RELATIVE FREQUENCY DISTRIBUTIONS 


The relative frequency of a class is the frequency of the class 
divided by the total frequency of all classes, and is generally 
expressed as 2 percentage. For example, the relative frequency of the 
class 66 - 68 in Table 1 is 42/100 = 42%. The sum of the relative 
Geequencies of all classes is 1, or 100%. If the frequencies in Figure 1 
are replaced by corresponding relative frequencies, the figure is called 


a relative frequency distribution. 





Pieure 1: Histosram and Frequency Polyeon of Data 
bie tec lane. 1. 7 





CUMULATIVE FREQUENCY DISTRIBUTIONS AND OGIVES 


The eal frequency of all values less than the upper class boundary 
of a given class interval is called the cumulative frequency up to and 
including that class interval. For example, the cumulative frequency 
up to and including the class interval 66 - 68 in Table 1 is 5 +18 + 42 
= 65, which means that 65 students have heights less than 68.5 inches. 

A table presenting such cumulative frequencies fee calc a cumulative 
Eypegquency distribution, cumulative frequency table, or briefly a 
cumulative distribution, and is shown in Table 3 for the Naval Academy 
student height Bigiweniueien. A graph showing the cumulative frequency 
less than any upper class boundary plotted against the upper class 


boundary is called a cumulative frequency polygon or ogive; an example 


is shown in Figure 2 for the Naval Academy student height distribution. 


Number of 
Students 
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Table 3: Cumulative Frequency Distribution 
of the Naval Academy Student Height 
Dectmaeutlem On 22ble 4 


10 





1CcO 


G. 
© 


60 
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RELATIVE CUMULATIVE FREQUENCY DISTRIBUTIONS 


The relative cumulative frequency or percentage cumulative frequency 


is the cumulative frequency divided by the total frequency. For example, 
the relative cumulative frequency of heights of Naval Academy students 
with heights less than 68.5 inches is 65/100 = 65%, which indicates that 


65% of the students have heights less than 68.5 inches. 


TYPES OF FREQUENCY CURVES 


Many times relative frequency curves are smoothed--that is, a smooth 
curve 1s "fitted" to a relative frequency polygon--and are used to 
represent situations arising in ASW applications, such as probability of 
detection. (Tnese probability distributions are developed in Chapters IV 
through V1.) 


EE 





Some of the more common frequency curves arising in ASW applications 


take on the characteristic shapes as indicated in Figure 3. 


Samet rical or rell-Shared 


5 


Berewea to the Left 
Meeeative Ckeunees) 


pimedal 


” 


Skewed to the Risnt 
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Kultimodal 


bueure 33° Tyres Of Frequency Curves 


~_ 


The symmetrical frequency curves are characterized by the fact 


that observations equidistant from the central maximum have the 


same frequency. An important example is the Normal curve shown 


in Chapter VI. 


NO 


In the skewed frequency curves the tail of the curve to one side 


of the central maximum is longer than that to the other. -If the 


longer tail occurs to the right the curve is said to be skewed 


to the right or to have positive skewness, while if the reverse 





TomuUbWemtENescuLvye ds Said to be skewed to the left or to have 
negative skewness. 
3. A U-shaped frequency curve has a maxima at both ends. 
4. A bimodal frequency curve has two maxima. 
5. A multimodal frequency curve has more than two maxima. 
Supplementary solved problems covering arrays, frequency distributions, 
histograms, frequency polygons, cumulative frequency distributions, ogives, 
and frequency curves are presented at the end of Chapter 2 Schaum's Out- 


line Series, Theory and Problems of Statistics. These problems should 


be consulted if a greater review of the material is desired. 


13 





CHAPTER II: SPECIAL MEASURES OF SAMPLE DATA: 
CENTAL TENDENCY AND DISPERSION 


MEASURES 


Although data relating to observations of events (for example, the 
number of submarine detections in a given region during a specific 
period of time) are important in sewallow ine methods for predicting 
similar events in the future, the data in the raw state are usually not 
directly suitalbe for making these predictions. The data must be 
reduced to a format that is still more manageable than the frequency 
histogram of Chapter I in order to facilitate examination further, and 
to a format from which the pattern of events on which the data are based 
can be determined. Such a representation may consist of "measures" of 
the data. Through the use of these measures, an entire data set can, 
effectively, be summarized by two or three single numbers. As a result, 
these measures are very important, and useful, in that they are devices 
by which sets of sample data can be quickly described. 

Two particularly useful measures of sample data are those which 
indicate or describe a relative central value of the sample data 
(central tendency), and the degree to which the data are spread out 
about a central value (dispersion). A measure of central tendency can be 
utilized to indicate a "common quantity" which can be used to describe 
roughly the entire sample (and possibly even the entire population). 

And a measure of dispersion can be used as an indicator of the Seite 
"error or "spread" that exists when a measure of central tendency is 


used to describe the sample or the population. 
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A digression will be made at this point to introduce two useful 
forms of notation, index or subscript notation, and summation notation. 


e 


SUBSCRIPT NOTATION 


When it is necessary to describe a variable, X, and the values it 
can assume, and also to distinguish between individual members of the 
group (these members are specific values which the variable X can assume—- 
the lst x, the 2nd x, etc.), subscript notation is a concise, useful means 
of describing specific values of the variable. To describe the values 
of the variable X with subscript notation, the first value (lst x) of 
the group is denoted by X4> the second value by Xo and so forth to the 
last, say the nth | value by x: The entire group of values of the 
variable, X, can then by denoted by 

X = (x1, Kyo ree x)> 

where the three dots indicate that middle values are included in the set, 


but are not explicitly written down for brevity. 


SUMMATION NOTATION 


Summation notation is a concise and useful means of denoting the sum 
of all or part of the individual observed values of a variable X. The 
notation used to denote the sum 

esi eee eceeee... tT xX 
1 2 3 n 


aS 


Mo 
w 
j= 


i=l 


The capital Greek letter L (sigma) denotes that a sum is to be taken over 


LS 





the observed values of the variable x. which follow it, and the values 
of Xs that are to be summed are those with subscripts between i = 1 and 
i=n. For example, to sum the third through the fifth observed values 


of X, the summation is written as 


That is, 


5 . 
: Sa x, + xX), + Xo 


This example shows that the lower limit on i need not be 1. Further, if 
the values X can assume (say, a total of six values) are SS Okeke = 3h 


X3 = roy, = 24 xeus a and xg = 9, then 
5 
ees eet = LO 


The following example shows how this notation can be used in an ASW 
setting. Suppose there are a total of six regions each of which can be 
monitored by a single sensor, where the number of possible targets in 
each region is known for given periods of time (from intelligence). It 
is important for the search planner to know the number of Saanell 
detection opportunities available from the monitored regions so that he 
can ensure that sufficient localizing/tracking platforms (ships, aircraft, 
submarines) are on standby for employment upon initial detection. As 

the decision to monitor regions three through five has been previously 
made, he wishes to specify the number of detection opportunities available 
in these regions on a given day in a standard format regardless of the 
particular values entered in the format for a specific period; this is 


because he wishes to include this calculation in a larger computer 
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program. The notation 


is a convenient format for this purpose. In any particular run of the 
program, numerical values for Xa, Ky and x5 must besentered: 
An important property of summation is that if a constant value, k, 


is summed over a range of N indices, then the summation of this 


quantity is equivalent to multiplying the constant k by N, since, for 


example, 
5 
DO KK =a eet KK = aK 
i=2 > nanan Vann 
N= (5 - 2) + 1 = 4 terms 


MEAN, MEDIAN, AND MODE 


Three particularly useful measures of central tendency are the mean, 
the median, and the mode. 

The mean, which is similar to the commonly used notion of arithmetic 
average, is computed by adding the observed values of sample data and 
dividing this sum by the eee number (N) of data points in the sample. 


The mean can be expressed in Summation notation as 


i ae 


l 1 


|e 
m2 


1 
where N is the total number of observed values summed, and X is read as 
"the mean value of X."" (Note that the upper limit on the summation index 


equals the reciprocal of the constant in front of the summation.) For 


example, the mean of the observed values KF Z ee 5, and x, = 1 of x is 
ae 3 
cae Geen ees) (2 5) 1) 8/3 
i=1 


Li 





The utility of this value (X) for X is that it can be taken, in many 
cases, to state a "middle value” of the apcenued data, and is often used 
as the best single number estimate of the values of X. 
For example, suppose it is necessary to search for an enemy submarine 
which is currently maintaining a "known" missile launching station, 
and the following conditions exist: 
1. the "known" missile launch station is actually a large ocean 
region; 
2. the area that can be searched effectively is one-half the size of 
the missile station; 
3. the positions of previous enemy submarine detections within the 
missile launch area are available to the search planner. 
The above information is depicted in Figure 4a below, where the dots 


represent the previous detection poSitions. 





i rocamicmilaninen oo vaturon 


Ficure 4a 
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Fisure 4b @ 


Mean FPosition 


Fisure 43; Submarine Search Information: 

@. rositione of Frevious vbetections; 

bee wean Cearcn frea 
It can be seen from Figure 4a that the positions tend to cluster about a 
certain small enclosed portion of the launch region; therefore, it seems 
reasonable to search about this small area in order to increase efficiency. 
The center of this small region is the mean position of previous enemy 
Submarine contacts. Therefore, to obtain the position about which the 
search should be centered, the mean of the previous positions is calculated. 
The search is subsequently centered about this mean position, as seen in 
Figure 4b. 

A special form of the mean is the weighted (arithmetic) mean. This 
form allows one to account for differing "levels of importance" of the 
individual sample observations. For example, if in a three element sample, 
x, were considered to be twice as important as Xo» and X3 four times as 


important as X95 the weighted mean of the sample would be computed as 
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ue 
my +m, t+ m3 


2x, + 1x, + 4x3 

~2+1=+=4 — 

where the m; is the relative weight, or level of importance, assigned 
to the lay element. 

The median value is the value which exactly divides the sample into 
two equal parts; there are exactly as many observations greater than the 
median value as there are less. When the number of observations is even, 
the median is exactly half-way between the two "center" observations, but 
for an odd number of observations the median value is the center observa- 
tion. 

For example, given the sample 

a 2; nis », Say Je 25h ad 
when arrayed is 1, 2, 5, 7 and has as its median 


Zt 
7S Ws Te 


See 
and we note that this is not one of the values actually observed. 

The mode or modal value of the sample is the most frequently occurring 
value/s in the sample. If more than one value occurs at this same 
maximum frequency (of occurrence), then the sample is said to be multi- 
modal; such a sample would have a frequency curve with more than one peak. 
For example, for the sample x, = 2, x9 = 5, X3 = 2, and x, = 3, the mode 
is x = 2, since it occurs twice in the sample, while all other observed 
values appear once. Although the mode is easy to ascertain from a sample, 
it is probably the least useful of the measures of central pendency, 


treated here. 
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It is seen that each of the three measures of central tendency 
mentioned can be used as an estimate of (or statistic which describes) 
the sampled population; however, in fact, the mean is probably the most 
useful of the three estimators--this is so because the mean is less 
biased than the other two measures. In other words, the mean is not 
influenced by extreme sample values to the extent the median and mode 
are. Therefore, for most practical applications, the mean of a sample 
reflects the central tendency of the parent population more accurately 


than the median or the mode. 


RANGE, STANDARD DEVIATION AND VARIANCE--MEASURES OF DISPERSION 


A useful statistical indicator on sample data and its resident 
population is the spread of the sample points (usually expressed relative 
to, or about, the mean). Such an indicator is called a measure of 
dispersion (spread). 

mieee particularly useful measures of dispersion are the range, the 
Standard deviation, and the variance. 

The range is obtained by taking the difference between the largest 
and the smallest observed value in the sample. While the range of 
sampled data is easy to compute, it iS not aS accurate an indicator as 
the other two mentioned. This is due to the large degree to which one or 
two sample values (a low frequency of occurrence) at relatively extreme 
distances from the center of the greatest portion of the sample can bias 
the estimate of the actual spread (or dispersion) of the sampled 
population. 

The standard deviation OG (lower case Greek sigma) is obtained by 


first computing the sample mean, and then using the formula 


Zi 





ae V [2 5 (x; - ¥)*) 


N i=l 
Since the standard deviation tends to weight each observed value by its 
frequency of occurrence, it is seen to be a more accurate measure of 
dispersion than the range; that is, the standard deviation is not unduly 
biased by a few extreme sample values. 

The variance is found by simply squaring the standard deviation. 
Consequently, the variance has all the advantages of the standard 
deviation. The mathematical form for the variance is 

| n 


— 9 
2 eee, SC =X) 
OMe a ae 


An equivalent form for the variance is 


where 


ma = 


ax. and X = G7 £ x.) 
TL 
i=l i= 


mile 


The latter equation is the 'short method" for computing O* and is the 
recommended method of computing the variance; the positive square root 
of Bile equation is the 'short method" for computing O and is recommended 
for computing the standard deviation. 

As an example which illustrates the equivalence of the two listed 
forms for computing om consider the case where three BT (bathythermo- 
graph) readings are observed and the first shows a sea surface temperature 
of 50°F, the second 48°F, and the third 52°F. The variance for these 


readings is first found by 
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1/3 Goa os Un 259 Gp x)" 


1/3 (0 + 4a 8/3 . 


Oe 


Now, Gr is found by 


ol 
G7 = x2 -xX° , 
where 
ay 3 9 = 2 2 
» ee es Lys ab x Ot to ae 2 =) — 7506) 3 
i 
and 
= - y) 2 
mS (ees x )- = [1/3 (50 + 48 + 52) 
i=l * 
Eee 
= 50° = 7500/3, 
so that 


CG? = 7508/3 - 7500/3 = 8/3 . 
(The second method looks longer than the first, but the mean had to be 
calculated prior to the first method.) Hence, this example illustrates 
the equivalence of the two formulas for Gir 

As an example of the usefulness of the standard deviation, consider 
the Normal probability frequency distribution (which is covered in detail 
in Chapter VI) shown in Figure 5. If sigma (the standard deviation) is 
town for the distribution, the percentage of cases in which any (observed) 
value could be expected to have come from the intervals (of the distribu- 
tion) X -G to X +O we OG oe 2 0) or X - 30 2 CNT is 


68.27%, 95.45%, or 99.73%, respectively, as shown. 
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Mere 5S: Normal rrovability Frequency Uietributions 


As an application, consider a situation where it is necessary to 
obtain the probability of detecting a submarine at a distance of 45 n.m. 
from the detection device (which is denoted by Paet (4 n.m.) ), given 
the FOM (Figure-of-Merit), the pertinent PLP (Propagation Loss Profile), 
and the OG -value EOE Ene probabil@ty of detection. 

The first step is to visualize a Normal proabbility curve whose 
"x-axis'' is located vertically along the 45 n.m. line of the PLP, and 


whose mean value is on the FOM line along the "x-axis," as seen in 


Figure 6. 
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Figure 6; PLP ({Frovragation Loses Profile) 
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Next, recall that the area under the Normal curve to either side of the 
FOM (the mean value for the distribution) Sorresponds to 50% probability 
of detection. Note that the Faet\*> n.m.) is represented by the darkened 
area under the Normal curve and under the PLP curve at 45 n.m., and that 
the intersection of the PLP and the axis of the probability curve is 
at -95 dB or ey relative to the mean (the FOM). Now the Pgo;(45 n.m.) 
is calculated as the difference between 50% (the lower-half of the area 
under the Normal curve) and one-half. of the area (under the Normal curve) 
from mC). Cort 1O (which was previously shown to be 68.27%), or 
Paet (45 n.m.) = [50 - % (68.27)] % = 15.87% . 

(Other methods for obtaining probabilities are presented in Chapters IV 
through VI.) 

Supplementary solved problems covering special measures of sample 
data are presented at the end of Chapters 3 and 4 of Schaum's Outline 


Series, Theory and Problems of Statistics. These problems should be 


consulted for a further review of the material. 
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CHAPTER III: COMBINATORIAL ANALYSIS 


COMBINATORIAL ANALYSIS 


Combinatorial analysis involves application of counting principles 
to obtain probabilities of complex events. These basic principles are 


presented in this chapter. 


THE FUNDAMENTAL COUNTING PRINCIPLE 


If an event can happen in any of n, ways, and if when this has occurred 


1 


another event can happen in any of n, ways, then the number of ways in 


Z 


which both events can happen is n,n,. Example: if 3 submarines are to 


Ls? 
transit 2 barriers, then the number of ways the submarines can transit 


the barriers are (2)(3) = 6 ways; if 5 submarines are to transit 3 barriers 


then there would be (3)(5) = 15 ways. 


FACTORIAL n 


Factorial n, denoted by n!, is equal to n(n - 1)(m - 2)...1. Thus 


Seem) (4) (3) (2) (1) = 120, 4:32 = ((4) (3) (2) (1)) (03) (2) (1) = 144. Tet 
is convenient to define 0! = 1. Further, each factor in the factorial 


product must be a (nonnegative) integer. 


PERMUTATION PRINCIPLE 


A permutation is an arrangement of r out of n objects with attention 


given to the order of arrangement. The number of permutations of n objects 
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taken r at a time is denoted by nPr, P(n,r) or Pn,r and is given by 


eee 2)... ee ee ee 
(n - xr)! 


For example, how many ways may 5 destroyers be placed in a 5-position 
screen? The first position can be occupied by any one of the 5 destroyers; 
that is, there are 5 ways of filling the first position. When this has 
been done there are 4 ways of filling the second position. Then there 

are 3 ways of filling the third position, 2 ways of filling the fourth 
position and 1 way of filling the last position. Therefore, n represents 
the 5 destroyers and r represents the number of destroyers that may be 


used in the screen which also is 5; hence, 


nPr = ___n! = 5! = 5! = (5)(4)(3)(2)(1) = 120 


; = 5. = 5: 
(n - r)! (5 - 5): Q! 1 
Now suppose 5 destroyers are available for placement in a 3 position 
screen. How many ways may the destroyers be placed in the screen? The 
first positicn can be occupied by any one of the 5 destroyers, then the 
second position can be occupied by any of the remaining 4 destroyers and 
finally the last position may be occupied by any of the remaining 3 
destroyers. Therefore, n represents the number of dag eiances 1 oh od oy eo 
and r represents the number of destroyers that may be used which is 3; so 


that, 


nPr = 5P3 =n! = __ 5! = 5! = (5) (4)(3)(2) (1) = 120 = 60. 
(ni“=<4)! (5 - 3): on 


COMBINATION PRINCIPLE 


A combination is a selection of r out of n objects without counting 


the order of arrangement. The number of combinations of n objects taken 
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Meat a time is denoted by nCr, C(n,r), Cn,r or G) and is given by 


ner — Nin... (n= 1. + 1) = ae = nPr 


r ri(n - r)! ae 


For example: Suppose a submarine with six torpedoes approaches a 
convoy of 10 ships. If the ships are selected at random and one torpedo 
is shot at each ship, how many ways may the torpedoes be shot at the 
10 ships? 


nCr = fale OE - (10) (9) (8) (7) (6) (5) (4) (3) (2) 1) = 
Pee) 6. CPOE 6) (65) (4G) (2941) ) (42) 1) 


3,628,800 = 3,628,800 = 210 ways 
(720) (24) 17,280 


Now suppose 3 Sean ships are warships protecting the convoy and it is 
desired to keep clear of these ships. If the remaining 7 ships are 
selected at random and one torpedo is shot at each ship, how many ways 
Mayeune torpedoes be shot at the 7 ships? 


nCr = n: = The = 7 ways 


Ee(ne- ¢): Oy C/2="6)4 
Supplementary solved problems covering permutations and combinations 


are presented at the end of Chapter VI of Schaum's Outline Series, 


Theory and Problems of Statistics. These problems should be consulted 


fore Lurtehner review of the material. 
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CHAPTER IV: ELEMENTARY PROBABILITY THEORY 


ELEMENTARY PROBABILITY THEORY 


The concept of probability is commonly used as a means with which to 
state (or predict) the percentage of possible cases in which a specific 
event will occur; for example, the percentage of events in which a specific 
number will appear on a single toss of a die. The probability of such an 
event occurring is, in many cases, given as the ratio of the number of 
Ways in which the event can occur to the number of ways in which ALL 
events can occur. So the probability that a two, for example, will occur 
on a single toss of a fair die is 


=! 


where the numerator represents the one way in which a two can occur, and 

the denominator represents the six ways in which each of the numbers 

1, 2, 3, 4, 5, 6 can occur; hence, the probability that a two will be 

the outcome of a single toss. of a die is one-sixth, or in other words, 

the ''chances" of this outcome occurring--an event--are "one in six." 
Another method of specifying probability of an event is to state 

the probability in terms of proportional areas. The basis of this method 

is to think of the probability of occurrence of ALL events as an area, 

say "A," and the probability of a particular event is simply the ratio 

of the areas 


P(event of interest) = : 


B 
A 


where B is the area corresponding to the event of interest. 
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Probabilities specified as propcrtional areas can be represented by 
diagrams of the corresponding regions. A diagram of two regions with 


areas "A" and "B'" is shown in Figure 7. 


Meagure 7; Frocortional Areas for Frobability Analosy 


This figure facilitates understanding the probability of event "B" by 
illustrating the area which is assigned to a region relative to the 

total area, "A," assigned to the occurrence of ALL events. Since the 
diagram visually represents the probability of the event, it is a useful 
means of intuitively developing the concept of probability. (A carefully 
cultured intuition can be very useful when dealing with probability.) 

It is necessary here to define some terms which will make the 
formulation of basic probability theory easier. The first of these EaeaS 
is PROBABILITY EXPERIMENT, which is defined as a type of trial procedure 
for selecting a specific outcome. An example is that of tossing a die 
to see what number comes up and noting the occurrences (or of flipping 
a coin to see whether "heads" or "tails'' occurs). 

Next, a SAMPLE SPACE is the totality of emeeones which can occur 
for the probability experiment. In terms of plane regions, the sample 
space is the large region ("A" in the previous example) which contains 


all possible outcomes of a specific probability experiment. 
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An OUTCOME is a specific value (or set of values) resulting from a 
single Re enausiion of a probability experiment, and it is the basic 
element (or smallest unit) of the sample space (for the specific 
probability experiment). An example would be the occurrence of one 
specific permutation (of the 216 possibie permutations) of the numbers 
appearing on the three die in the (probability) experiment of tossing 
three dice on a single toss and noting the numbers which appear, where 
the goal of the experiment is to see whether or not a one, a two, and a 
three appear on the toss. This can be represented as one of the totality 
of points that make up the sample space (a three dimensional space, as 
each outcome is composed of three numbers). 

An EVENT, on the other hand, is a set of outcomes; in the previous 
example, an outcome is the sequence of the three specific numbers which 
@ccur as a result of the toss of the three dice, and the event consists 


of six points of interest (outcomes) in 3-space, as shown in Table 4. 


Dew sl Die #2 Die 75 
( 1 : 2 ; 3) 
(1 — 3 : A) 
[2 : 1 , on) 
( 2 ; 2 ; 1) 
( 3 ; 1 : 2 ) 
( 3 / 2 ; i) 


iemle 4: The Six Cutcomes That Comprise (Saticfy) 
. the Even. On Cotainine- a Cne, a Two, and 
ae iaree On @ Sinzle Toss of Three Dice 
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In terms of the proportional areas analogy, an event is an area ("B" in 
the earlier example) which contains one or One outcomes and is a Ssub- 
area of the sample space (NA). An extension of this terminology is the 
term JOINT EVENTS, which describes a combined event in which two or more 
Simple events occur on the same iteration of the experiment. 

In summary, the sample space is comprised of all possible outcomes 
of a probability experiment, and these outcomes are grouped into sets 
called events--outcomes are the elements of an event. 

The remaining basic concept in the theory is that of understanding 
the PROBABILITY FUNCTION. A probability function is a device by which 
a number (or area) in the sample space is assigned (as previously 
mentioned) to an event. An example of a probability function is the 
assignment of the value 1/3 to the probability of each of the events 
that a single submarine, during a single transit, will cross one of 
three barriers each of which the submarine is equally-likely to cross. 


This is written as 


P(SS transits #1) = 1/3 
P(SS transits #2) = 1/3 
P(SS transits #3) = 1/3 , 


which can also be represented in an area diagram as in Figure 8, where it 
is seen that all three areas are equal and that no area overlaps any 
other, which is interpreted to mean that the submarine can cross only 


SmewoL the barriers during a single transit. 


Sy, 





Area 1 Area 2. Area 3 
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Figure 8: Frcportional Area Depiction of 
Bquedi=likelinood Frooability Berriers 
There are basic restrictions on the probability function. One of 
these is that the values assigned to events by the probability function 


mist SUm CO one, Or 


where i corresponds to the aoe event. 
Another is that the values assigned must be between zero and one, or 
0 = PP, = ae 
This restriction is a result of the fact that probability is a ratio of 
a small area to a larger one. Also, since a specific event in a given 
experiment will either occur or not occur, the sum of the probabilities 
that the event will and the event will not occur must equal one. This is 


represented mathematically as 


P(Specific event occurs) + P(Specific event = l 
does not occur) 


or by rearrangement 


P(Specific event does not occur) = 1 - P(Specific 
event occurs) 


or 


P(Specific event occurs) = 1 - P(Specific event 
does not occur) . 
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As seen by the last she arrangements of the first equation, the probability 

of an event occurring and the probability of the same event not occurring 

on the same trial of an experiment are complements of each other in the 

probability sense; that is, the probability of an event occurring can 

also be described by one minus the probability of the same event not 

occurring. As will be seen later, this idea of PROBABILITY COMPLEMENT 

of the occurrence of an event has many useful computational applications. 
We now pass from probabilities for single events to probabilities 

for several events. There are two cases of interest: the probability 

of joint events, and the probability of at least one of two (or more) 

events. Joint events are denoted by the word "and''; for example, the 

probability of events A and B occuring. The other case is denoted by 


the word "or"; 


for example, the probability of event A or B (or both 
occurring, where A or B or both is the sense in which it is used in this 


course). This "or' probability is shown by the proportional areas 


analogy as in Figure 9. 





iwemire GS. Probab lity of Events 4 “or 2B 


it is seen from this figure that the probability of event A or B is 
represented as the probability of A plus the probability of B, minus 

the probability of A and B occurring. The reason for subtracting the 
joint probability of A and B is that the area corresponding to the joint 


probability (''cross-hatched" in the figure) is common to both event A and 
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event B, so that when area A is added to area B the "cross-hatched" area 


is added twice; therefore, the joint probability must subsequently be 


subtracted once. The probability of A or B occurring is therefore expressed 


mathematically as 


P(A or B) = P(A + B) 


P(A) + P(B) - P(A and B) 
P(A) + P(B) - P(A ° B). 


Figure 10 shows a case in which events A and B have no area of overlap; 


that is, A and B do not have any area in common. 
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It is easily seen from this figure that any outcome in event A cannot 
possibly also be a part of event B; in fact, the existence of the outcome 
iieeiener A Or B precludes its simultaneous existence in the other event. 
Two such events are said to be MUTUALLY EXCLUSIVE. 

The joint probability of A and B can be shown through the proportional 


areas analogy as in Figure ll. 





eeure Wis Tse Joint Frobebility of Events a und B 
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It is seen from the figure that the probability of the joint event is 

the ratio of the "cross-hatched" area (the joint event itself) to the 

entire area (which is equal to one since it is a probability space). 

Mathematically the joint probability of events A and B is represented as 
P(A and B) = P(A - B) = P(A) + P(B) - P(A + B) 


(This is simply a rearrangement of the "or" 


probability equation developed 
previously.) 

The symbol P(B A) denotes the CONDITIONAL PROBABILITY of event B 
given that event A has already occurred; that is, it is the probability 
of B on the condition that event A has occurred previously. It is seen 
from Figure 12, that P(B A) differs from P(B) by the fact that the 
sample space (S) from which B is drawn is larger than the resultant 
sample space (S') from which B is drawn when A has been given to occur 
on this same outcome (the event "B given A"). This is due to the fact 
that S has been reduced to S' after it is known that the event yet to 
occur (B) is contained in the event that has already occurred (A); 
therefore, in terms of the ratio idea, the denominator has been decreased, 


thereby increasing the probability of B for the conditional case (relative 


to the unconditioned case). 


OQ I=-O 


S'- A 
Meme t2, COmrariccon of Unconciticned Frobability to 
Goncittenal (or Conditicned)srrobability 


CONDITIONAL PROBABILITY actually epitomizes the usefulness of probability 


to ASW. That is, generally, an area is searched for the presence of a 
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submarine, and both the cases in which the submarine 1) has or 2) has not 
been detected as a result of previous search efforts are important 
information. Each allows the searcher to refine his prediction of the 
target's position (in most cases) to a smaller and sales portion of 
the original search region and, thereby, increase the probability of 
‘' detecting the submarine on subsequent portions of the search. It is 
obvious how this occurs in the case where the target is detected on a 
given previous effort, but the case of no detection on previous search 
efforts may not be so obvious. The fact that the submarine was not 
detected on previous portions of the search allows the searcher to 
eliminate (again, in most cases) that portion which has been searched-- 
similar in idea to "sSterilization'"--thereby conditioning the original 
sample space (this should be visualized as multiple iterations of the 
process depicted in Figure 12). Having previous knowledge (intelligence 
data) which allows the searcher to eliminate portions of the (original) 
search area is another way in which this conditioning can occur. 

The joint probability of events A and B (discussed previously) can be 
represented as | 

P(A and B) = P(BIA) - P(A) . 

This equation has more intuitive appeal than the previous equation for 
joint probability, as it is simply an extension of the "Counting Principle"; 
it indicates that the probability of both event A and B occurring is the 
BeObability that event A occurs and "event" B given A occurs. 

From the last equation it is seen that if P(A) 0, then the 
conditional probability of B given A is mathematically represented as 


PCBpADm= PCA - By / P(A) 
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Three important methods for computing conditional probabilities are: 
1. By utilizing the definition of ae probability 
P(BJA) = P(A ~ B) / P(A) 

For example, it is known (by some means or another) that a submarine 
skipper will not choose one of three known and equally-likely barriers 
to transit, then the resulting probability is increased beyond that for 
the original choice due to the sample space drawn on having been reduced 
from three possible paths to two. -If the barrier eliminated is #1, then 
the resulting sample space S' consists of barriers #2 and #3, vice the 
original sample space S which was made up of barriers #1, #2, and #3. 
Therefore, the probability that the submarine will transit barrier #2, 


given it will transit either #2 or #3 is seen to be 


P(A - B) =P | #2 or #3) and (62)| 


P(BJA) = 
P(#2 or #3) 
= P(i#2) since the only way in which #2 and 
P(#2 or #3) (#2 or #3) can occur is if #2 occurs 
= 3 = 1/3 = 1/2 
Sek 1/37 273 


2. By simply writing the conditional probability down, as sometimes 
the information that A has occurred enables the determination of the 
probability of B. 

For example, in the experiment of drawing two cards from a deck of 
playing cards one at a time without replacing the drawn card in the deck, 
it is known that the first card drawn is a Spade. Given that the first 
card is a Spade, the probability that the second card is also a Spade 
can immediately be written down as 

P(draw #2 is a Spade draw #1 is a Spade) = 12/51 , 
Since the original 13 Spades in the deck have been reduced to 12, and 


the original 52 total cards (the sample space) to 5l by the first draw. 
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The probability thus formed is consistent with the ratio concept of 
probability.. 

3. By utilizing Bayes' law, which will now be presented without 
proof and with only the following restriction. 

Bayes' law holds for experiments on sample spaces which consist of | 
definite groups of events that do not “overlap.'' Such a sample space is 


seen to correspond to the delimited area/s in Figure 13. 





items 4 frartitioned Sample @meace 


Such a sample space is said to be PARTITIONED. 
Bayes' law states that on a partitioned space the probability of an 
event A occurring, given that event B has previously occurred is, 


P(BJA) - P(A) + P(BJAS) - P(AS) 


where P(A‘) is the probability complement (covered earlier) of P(A), or 
P(AT) = 1 - P(A). 

For example, suppose it is known from intelligence reports that 
missiles of type C and D are carried by exactly two classes of Soviet 
submarines, say class A and class B. Submarines of class A are known 
to carry 16 missiles of type C and eight of type D, while those of 
class B are known to carry ten of each. A missile is fired from a 
submarine and observed to be of type C. Find the probability that the 


missile was fired from a class A sub, under the following conditions: 
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1. Each missile to be fired is chosen at random from the total 
supply of missiles aboard the sabeeeines and 
.2. the Soviets normally operate twice as many subs of class B as 

of class A in the area. 

The problem is to find P(A C); it can be found using Bayes’ law. To 
determine the values to be used, it is best to first draw two blocks 
representing the class A and B submarines and the types of missiles they 
carry as shown in Figure 14. (Remember that there are two class B subs 


for each class A.) 





ORS rea 
Oe moe 
oO mies 
Class A submarine Two Cleése 5B submarines 
Ficure 14 


By Bayes' law then 


» P(CIA) - P(A 
(alc) P(C{A) + P(A) + P(CJAS) + P(AS) 


here AS = B; therefore, 


= P(CIA) - P(A 
P(C{A) * P(A) + P(C|B) - P(B) 


if (16/24) - (1/3) Ss A = OS 
(iG) 24) se 1/3) + (10/20) + (2/3) 5/9 


As a second example of the application of Bayes' law, consider a 
circuit breaker used to arcexeacis a certain piece of ASW hardware from 
short circuits. At the moment the "on" switch is thrown to activate 
the equipment, the circuit breaker essentially "tests" the circuit for a 
Short. If a short is detected, the circuit breaker should immediately 


trip; otherwise it should allow operation to continue. 
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Suppose it is known that although short circuits do not always trip 
the circuit: breaker, they do trip it 99.5% of the time. On the other 
hand, suppose the probability that the circuit breaker trips when there 
is no short is 3%. If the probability that there is a short in the 
equipment when the switch is thrown is 0.01, find the conditional 
Somepability that there is no short circuit given that the circuit breaker 
does not trip. 

The technique of solution involves several steps: 1) describe 
mathematically the probability "events" involved; 2) formulate what is 
given and immediate consequences of the given; 3) formulate the unknown; 
4) write down Bayes' law as it applies to the events of this problem; 

5) substitute the given numbers and solve. 

1) Let: T be the event that the circuit breaker trips; 

T° the event that the circuit breaker does not trip 


tt in 


S the event that there is a short circuit when the "on 


switch is thrown; 


S° the event that there is no short when the "on"! 


switch is thrown. 

2) We are given that 

Eels) 700995, ECrIS-) = 0.03, P(S) = 0.01. 

From the given it follows immediately that 

p(Tc{s) = 0.005, P(Tc|s*) = 0.97, P(SS) = 0.99. 
3) The problem is to find 

P(s¢| T°) 

4) By Bayes' law, 


Sens USS) 2 EIS 
PST) = Saepsey- Bey F PCTS) PS 


5) Substitute: 
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0.97) (0.99 


P(s“[T°) = 7797) (0.99) + (0.005) (0.01) 


eee 2005 = oo garis 
0.9603 + 0.00005 


STATISTICAL INDEPENDENCE 


Another important and useful concept is that of STATISTICAL 
INDEPENDENCE. Two events are called statistically independent if know- 
meaeenot the occurrence of one does not affect the probability of 
occurrence of the other. What this means for events A and B, where 
P(A) > O and P(B) =O, is that P(BIA) = P(B) and P(A|B) = P(A). From 


conditional probability it is known that 


P(BIA) = P(A) Or 
P(peence yy = PCE A= PCRS P(A), 
but since P(BIA) = P(B), it is said that A and B are statistically 
independent if 
P(A * B) = P(A) *° PCB) 

Pormecxample, if an air screen barrier and a destroyer barrier are 
placed about a convoy in such a manner that the probability of detection 
on one of the screens does not affect the probability of detection on 
the other screen, then the joint probability of detection of the submarine 
in both screens is simply the product of the probability of detection 
from each of the screens. Therefore, if the air screen barrier has a 
0.50 probability of detection and the destroyer screen barrier has a 
0.30 probability of detection, then the Peete that the submarine 
will be detected on both screens is 


P(Detection on both ASB and DSSB) = (0.50) (0.30) = 0.15. 
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This probability is, of course, contingent upon the submarine attempting 
to penetrate the screens. (The fact that fae joint probability is 
smaller than either of the (marginal) probabilities which comprise it 
may, at first, seem intuitively incorrect; that is, it is easy to 
misinterpret the statement of what probability is being sought in such 
a problem. However, when it is realized that what is sought here is 
the probability that both platforms simultaneously hold contact on the 
submarine, the smaller resultant (joint) probability then seems more 
intuitively consistent.) 

A major distinction between statistically independent events and 
mutually exclusive events is that the joint probability of the mutually 
exclusive events is zero, since the occurrence of one of a group of 
mutually exclusive events precludes the (simultaneous) occurrence of 
any other event in that group. The joint probability of independent 
events, on the other hand, is clearly not zero unless at least one of 
the independent events has zero probability of occurrence; If A and B 
are independent, then 


PCA)? (EG) = OF 


P(A and B) 


if and only if P(A) = 0 or P(B) Gz 


Supplementary solved problems covering elementary probability theory 
are presented at the end of Chapter 6 of Schaum's Outline Series, Theory 
and Problems of Statistics and at the end of Chapters 3 and 4 of Schaum's 
Outline Series, Theory and Problems of Probability. These problems should 
be consulted for a further review of the material presented in this 


chapter. 


43 





CHAPTER V: RANDOM VARIABLES 


RANDOM VARIABLES 


A variable which assumes a numerical value determined by the out- 
come of a probability experiment is called a random variable. Actually 
a random variable is a shorthand notation for listing each possible out- 
come of an experiment. For example, if a coin is tossed two times and 
the number of heads is counted, then the random variable may assume the 
momme O, L, or 2. 

There are two types of random variables, discrete and continuous. A 
discrete random variable is a random variable X on a sample space which 
has a finite or countably infinite number of outcomes; say, X(S) = 


X1> X9,--+5X We can make X(S) into a probability space (conditions 


ae 
on this are listed with the probability distribution below) by defining 
the probability of x; to be P(x = i), which may also be written f(x,). 
This function f on X(s), defined by £(x;) =r CG =e) owed led sie 
distribution or probability function of X and is usually given in the 
form of a table, as shown in Table 5. (It should be noted that in the 


discrete case the distribution is the probability function developed 


in Chapter IV.) 





ro 
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The distribution f satisfies the conditions: 
n 
Gwe. 0 and. yo f(x) = 
1 : i 
i=l .- 
For example: suppose a pair of fair dice is tossed. We obtain the 
finite equiprobable space S consisting of the 36 ordered pairs of 
numbers between 1 and 6: 
or eml sl). Ci 2),. (Cl hs cag COO) 
Let X assign to each point (a,b) in S the maximum of a and b; that is, 


X(a,b) = MAX (a,b) = X(S) = 1, 2, 3, 4, 5, 6. We compute the distribution 


feo X: 

f(1) = P(X = 1) = P( 1,1 ) = 1/36 

my — P(X = 2) = P( (2,1), (2,2), 1,2) ) = 3/36 

meee P(X = 3) = P( (3,1), (3,2), G,3), (2,3), C,3) ) = 5/36 


and similarly £(4) = P(X = 4) = 7/36, £(5) = P(X = 5) = 9/36 and 
£(6) = P(X = 6) = 11/36. The information is put into the form of a 


table as shown in Table 6. 





Ta lemOcmeDAStribution £01 x 


It can also be seen that f(x;) 2 0 and 
6 
L£(x;) = 1/36 + 3/36 + 5/36 + 9/36 + 11/36 = 36/36 
i=l 
A continuous random variable is one in which the set of numbers 


i 


X(S) is an interval or a set of intervals. If the interval (a S X = b) 
is an event in S, then the probability would be P(a = X = b) and is 


Shown in Figure 15. 
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Figure 15: Fla £ X €b) = Area of Shaded Rezion 


Where P(a = x = b) corresponds to-the area under the graph of £ between 
x = a and x = b. In the language of calculus, P(a = x= b) [eo ax, 
which in effect can be approximated by adding up areas under cee 
between a and b. For example, take the area shown in Figure 15 and 
partition it into smaller areas as shown in Figure 16a and b, and then 


sum the areas of the resulting rectangles. 





Ficure 16a 





Firure 16b 
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An approximation of the area of each subregion of Figure 16a and 16b 
can now be made by taking the area of the rectangle which has the same 
base as the subregion and height equal to the minimum height for that 
subregion. By adding up the areas of the approximating rectangles, an 
approximation is now obtained for the area of the total region between . 
a and b. The area that has not been included is the shaded area, which 
makes the results only an approximation. It can be seen that, due 
to the increased number of partitions each with a smaller base of 
Figure l6b, the shaded area is smaller, which results in a better 
approximation. Actually the greater the number of partitions, the 
smaller the shaded area, and the better the approximation. If f(x) is 
known, the exact area between a and b may easily be determined by using 
calculus. However, it should be noted that the use of calculus 1S not 
required for most practical cases of interest in this course, as values 
are obtained from standard tables. Standard tables exist for most 
continuous random variable distributions used in ASW applications; there- 
fore, it is not intended that the student have a working knowledge of 
calculus. The Poilow ine example is given, therefore, simply to show an 
appreciation of calculus: 


Let X be a continuous random variable with the following distribution: 


f(x) 


ot Oo = x = 2 
f(x) = 
0 elsewhere 


Paacuire ol joer he Of of 
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b | ees, 2 
Maen P(L ~ X = 1.5) a ore -[ 4xdx = 


7 = 2.002) - 229. = 3125). 
a L 





1 
where it is seen that the calculus operation of integration has been 
utilized to "sum" the area of the region under the curve between 1 and 


1.5 as shown in Figure 17. 


DENSITY FUNCTION 


The function that describes the probability of each interval on the 
x-axis (for example, the bimodal function, f(x), in Figure 15) is called 
a continuous probability function or simply the (probability) density 
EwmetLton of X. The fenoey function must satisfy the constraints that 
f(x) = O for all x, and that the total area under the density function 
must equal 1. Special notice should be given the fact that the 
probability of a point-value of the random variable X is strictly equal 
to zero, even though the value of f at the point x may be non-zero. An 
intuitively appealing explanation of why the probability of a single 
outcome of a continuously distributed random variable is zero is given 
in the following example. 

Consider an attack submarine (SSN) searching for enemy submarines. 
The probability that an initial detection bearing will be exactly some 
prescribed angle is zero. The key word here is exactly. For example, 
the word "exactly" would mean that a prescribed bearing of 30° is 
exactly that right down to every decimal place. If the "exactly" is not 
used, then "30°" may represent a whole event, which would have a positive 
probability. For example, if we are rounding off to the nearest degree 
and the density function is constant over the interval from 0° to 260.4 


then "30°" would mean the event from 29.5000...° to 30.499...9, which has 
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probability 1/360. If we are rounding off to the nearest minute, then 
"300" refers to the interval from 29°59.500...' to 30°0.599...', which 

has probability 1/21,600. If we round off to the nearest second, then 
"30°" is the event from 29°59'59.500" to 30°0'0.499...", which has 
peobability 1/1,296,000. “Exactly 30°" is a single point, not an interval, 
.and its probability must be smaller than the probability of every 

interval that contains it, no matter how small. Thus the probability of 


a bearing angle of "exactly 30°" is zero. 


CUMULATIVE DISTRIBUTION FUNCTION 


Let X be a random variable, x a real number, and F(x) the probability 

that X takes on values less than or equal to x: 
F(x) = P(X ~ x) 

Then the function F is called the cumulative distribution function 
(CDF) of X. CDF values are of particular importance in ASW in their 
application to search regions. For example, (in the continuous random 
variable case) it is important to know what the probability of contact 
on a target along the search length, L, is up to and including the 
point X = L/2, where exactly half (or 1/3 or 3/4 or some other portion) 
of the area has been searched. This type of information is essential to 
allow reasonable allocation of available search effort. Further, the CDF 
of the target evaluated at the x, value corresponding to that point 
half-way (in this example) through the search region represents this ° 
desired probability (cumulative), as seen in Figure 18, where, for purposes 


of generality, x is taken to be the middle or center x value of the search 


region's axis. 
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For example, using the data shown in Table 6, the cumulative distribu- 
tion function F of X would be as shown in Figure 19 which consists of 


summing the frequencies, f(xj;), corresponding to those values of x less 


than or equal to x;. 





Peoure|os “Grapn of 


Observe that F in Figure 19 consists of a sequence of horizontal line 
Segments and two rays. One ray coincides with that part of the x-axis 


to the left of x = 1 because P(X = x) is 0 if x «~ 1. There is a jump 


in the graph at x = 1 because F(x) = 0 to the left of 1 and F(1) Si ae 


The graph has constant height F(x) 


W/36etor.1. > s2e 2. Atix = 2 
there is a jump of 3/36 (or £(2) = 3/36) bringing F(x) up to F(2) = 4/36. 
As we proceed along the graph from left to right, we find a jump at each 


preerer x —- tL, 2, 3, 4, 5, 6. For x = 6, the graph has another ray 
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extending indefinitely to the right because F(x) = P(X= x) is 1 if X is 
any real number greater than or equal to 6. If we desire to find the CDF 
for a number between the integer values, for example F(5/2) we can see 
from Figure 19 that F(5/2) = 4/36. To compute the CDF of a continuous 
random variable, it is again necessary to use calculus; however, its 


. values can usually be obtained from a standard table. 


MEAN, VARIANCE AND STANDARD DEVIATION 


The mean, variance and standard deviation are also used to describe 
a random Per iail &: These will be shown here for discrete random variables 
only, as the values for continuous random variables will be found from 
tables. 

The mean or expectation of X, denoted by E(X) (or simply E or Wl ) 
is defined, for discrete random variables, by E(X) = xy E(x.) ax x, £(x,) + 


coe x EO) Xs f(x, ) 


1 


hms 


i 
That is, E(X) is the weighted average of the possible values of X, 
where each value is weighted by its probability. For example, to compute 
the mean of X for the data of Table 6, 


E(X) = 
1 


t MD 


xi f(x.) = 1(1/36) + 2(3/36) + 3(5/36) + 4(7/36) + 5(9/36) + 
1 


6(11/36) = 161/36 = 4.47 
The mean of a random variable X measures, in a certain sense, the "average" 
warue of X. 

The variance of X measures the "spread" or "dispersion" of X. Let X 
be a random variable with the distribution shown in Table 5. Then the 
variance of X, denoted by Var(X), is defined by Var(X) = ; (x, - Hh) £(x;) 


i=l 
=EC(X - pl )* ) where PP is the mean of X. For example, using the data 
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from Table 6 Var(X) = x (x, - pr) £(x,) = (1 - &.47)% Lyf Sie) ae 
(2 - 4.47)? 3/36 + (3 7 es 5/36 + G - 4.47)2 7/36 + (5 - 4.47)? 9/36 + 
(6 - 4.47)2 11/36 = .334 + .508 + .3 + .043 + .07 + .715 = 1.97 . 

Since the standard deviation of X, denoted by Ons is the 


(nonnegative) square root of Var(X): Ge = V Var(X) . For example, 


using the data from Table 6, the standard deviation 


O,, = Y var(X) = Y1.97 = 1.4. 
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CHAPTER VI: BASIC STATISTICAL DISTRIBUTIONS 


BACKGROUND 


Generally, systems which arise in solving many practical ASW problems 
must be describable in terms of a simple mathematical model, and in most 
interesting cases this mathematical description is in probabilistic 
(rather than deterministic) terms. Before a problem can be analyzed for 
probabilistic solutions the basic unknown quantities must be represented 
as random variables (which constitute the mathematical model). Once 
this is done, the density function or the CDF is determined, and the 
system is thereby completely described in probabilistic terms. The only 
task remaining then is to compute the probabilities which constitute a 
problem solution, and this in itself is theoretically straight forward. 
However, it should be noted that some problems can be modeled more 
easily than others; yet an important advantage of this method is the 
possibility of describing many ASW problems in terms of a few "well 
known" random variables. The distributions which are developed here are 
the Bernoulli, Binomial, Chi-Square, Uniform, Normal, Bivariate Normal, 


Time-Varying Normal, Exponential and Poisson. 


BERNOULLI DISTRIBUTION 


A random variable X with exactly two possible values, say 1 and Q, 
where the probabilities P | x = 1 = p and P [x = 0| =1-+pe=q, is 
called a Bernoulli random variable. Such a random variable is important 


because it may be used to describe any random experiment with two possible 
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outcomes, such as target detection from a single ping of an active sonar. 
Indeed, it is the Bernoulli distribution that is used in making a decision 
requiring a "Yes" or "No" answer. However, for our purposes Bernoulli 
random variables are used primarily as the basis for the Binomial 


distribution, which is discussed next. 


BINOMIAL DISTRIBUTION 


The Binomial distribution uses the two alternatives of the Bernoulli 
@istribution, P| x = a = p and P [x = of = q, and an n-fold repetition 
of the basic Beet e tate: The problem now is to find the probability of 
exactly i successes in n repeated independent trials; for example, the 
probability of exactly two detections in six pings of an active sonar, or 
the probability of one detection in four sweeps of a radar. The Binomial 


random variable with parameters n and p is the discrete random variable 


given by P | x = i Gee ipaq) = Oil; 20288) gum, and) P | x= i = 0, 
otherwise. (Recall that (*) was defined in Chapter III as ene 5 
EAL Gaye Seay be 


In order to use the Binomial distribution to find the probability of 
exactly 2 (or i) detections in 6 (or n) pings of an active sonar, the 
probability of a single ping success must be known. For example, if 
the probability of success is 0.6, or p = 0.6, then q =1 - p = 0.4 and 


therefore P | x = 2| = G) pigh-i = o (0.6)? (0.4) 2 = 


mee __ (0.6)* (0.4)4 = 


2G Oo 
ee (06) 0256) 2 
BENG - 2) me ay, oe) Kk ) 


6 

el; Uo tees 

wie) (0.36) (0.0256) = 0.138 . 

The mean of a Binomial random variable with parameters n and pis pPf= np, 
the variance is oh = npq and the standard deviation ‘er = npq. In 


the above example, pf = np = (6) (0.6) = 3.6, Ge = npq = (6) (0.6) (0.4) = 


1.44 and O = V1.44 


1.2. With a probability of 0.6 and 6 pings, we 
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would expect 3 to 4 detections from every 6 pings on the average; for 
this sraetbileann QO indicates, with a 68% confidence, that the number of 
detections from 6 pings is between 2.4 and 4.8 on the average. 
The following problems show how the binomial distribution can be used; 
1. A P-3C aircraft has been loaded with eight active sonobuoys 
which are known to have a 20% failure rate. These buoys are to be 
utilized in a situation requiring that failed buoys be replaced 
immediately to maintain pattern integrity. As this requires fast, 
coordinated action by the aircrew to reposition the aircraft for ''delivery" 
and to prepare replacement buoys, it is important for them to estimate 
the number of buoys requiring replacement. Determine the probability 
that, out of four sonobuoys selected for a pattern, 
(a) 1 sonobuoy will fail 
(b) O sonobuoys will fail 
(c) 2 sonobuoys will fail 
(d) at most 2 sonobuoys will fail. 
(Note that part (d) can be simplified through use of parts (a), (b), (c), 


and the probability complement concept.) 


Answer: (a) P |x = 1 Go .8)- Gn) (OZone 


ace) 


(4) (0.2) (0.51) = 0.4096 
(b) 0.4096 
(c) 0.1536 
(a) Pixs 2| =i -P[x>2| : 
1 - [ (3) (0.2)3 (0.8)! + (i) (0.2)4 (0.8)9 | = (0), S72E 
2. If the probability of damaging a submarine with a single depth 
charge from a destroyer is 0.5, what is the probability that 6 out of 10 


depth charges will damage the submarine? 
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Answer: Q.21 
3. Compute the (a) mean, (b) variance, aaa (c) standard deviation of 
a binomial distribution in which p = 0.7 and n = 60. 
Answer: (a) 42 
(b) sak2a6 
(c) Sess 
4. Consider a box of 200 fuses. It is known that for each fuse the 
probability of being defective is 0.02. What is the probability that 
at most six fuses are defective? (Set up the problem but do not carry 
out the calculations; although the solution is not too difficult if one 
has the use of a pocket calculator. It will be seen later that a good 
approximation to this answer can be obtained much easier by means of 


the Poisson approximation. ) 


Answer: 6 ; . 
Pp | x = 6| = y ——m pign-2 
i=0 nee a = i). 
6 | 
= ¥ 200. (0.02)? (0.98)290 - i 


Sy UA@OOm=E) 


It 


0.8914 


UNIFORM PROBABILITY DISTRIBUTION 


The Uniform probability distribution represents situations in which 
any two sub-regions of the same size have equal probabilities, as explained 
below. This distribution is shown in Figure 20, which shows that the 
random variable X may assume only values netweenen = a and X = b, or 
= 


a X= b, with non-zero probability (the probability of any region 


Sitesidade the interval a= X= b is zero). 
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mmeece 7Ot Uniform Frob2bility Distribution and Ite Cir 


Since the area under a probability distribution must equal one, height 


of the density function between a and b must be il . The mean value 
b-a 
of a Uniform distribution coincides with the central x-value, and is 





(b + a)/2. The variance is (a - b)2/12. The figure also shows the 
cumulative distribution function for the Uniform distribution. 

A Uniform distribution is described by writing "Uniform (a,b) or 
U(a,b ) distribution"; for example, a Uniform distribution having its 
non-zero probability values in the interval 0 to 1 is denoted as "a 
Uniform zero, one distribution." 

An example Peoicaeson of the Uniform random variable is the 
probability Deel sion of the location of a submarine in a narrow or 
restricted body of water. If it is known that the submarine will not be 
in water less than 100 feet deep, then the probability that the 
Submarine will be in such regions is zero. If there is no information 
to indicate any preference in location, it must be assumed that the 
probability distribution for the location of the submarine is uniform, 


as shown in Figure 21. 
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Pisure 21: Frofile of a Narrow Chennel and Associated 
Ooerorm rreeability Listes suvaen 


Suppose that a = 1,500 yards and b = 20,000 yards from a reference 
position (at one of the restriction boundaries, for example). The 
probability that the submarine is located in any subinterval, say of 


width 1,000 yards, is 


Mea x =| = Gl os) ae ee 
Ca) 
= (1,000 yards) ee yards 
= 0.05 


Special note should be made of the fact that the dimensions (yards, in 
this example) associated with the physical problem must cancel so that 
the resultant probability is a dimensionless number. The probabilities 
of any two intervals of the same length within the interval from a to b 
are equal; that is, 
ies K=a| -p[e= x= é| if d-cH=f£-e, 

miememboth cs X= dand e= X= f£ are within a= X= b. 

Suppose that two sensors at separate locations each produce a line of 
bearing (LOB) to a submarine target. Each of the LOB's has a bearing 


emtEor of ten degrees @ 5°) associated with it. Suppose further that the 
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LOB's intersect in a plane region "centered" 117.5 n.m. from the first 
sensor and 66.5 n.m. from the second. The information available to a 
prosecuting platform is chews fine location of the submarine is Uniformly 
distributed over the plane region. Since a bearing error of 1 degree 
represents a range error of about 1 n.m. (perpendicular to the LOB) at 
a range of 60 n.m., the area is a trapezoid as shown by the "cross- 


hatched"' region in Figure 22. 





Eeeorox,. CO ml. 


Nomen ve BO SARA 


A 
Sensor Fi 


Sensor # 


Peeelre 22;  irarezoid Froduced Ey Iwo Sensors 
Suppose sensors #1 and #2 are separated by 135 n.m. and the LOB's are 
60 degrees from sensor #1 and 330 degrees from sensor #2; then the 
trapezoid approximates a rectangle with sides 20 by 12 n.m.; therefore, 
2 


the probability region has an area of about 240 n.m.“©. If a sonobuoy 


with a 3 mile radius of detection is dropped in the probability region, 
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(a) what probability of detection does this sonobuoy yield? 


(b) what probability of detection would six nonoverlapping 
sonobuoys yield? 
Answer: (a) Probability of Detection = area of sonobuoy 
area of trapezoid 


ike ee 
240 240 








(b) Probability of Detection = 0.71 


NORMAL DISTRIBUTIONS 


THE NORMAL DISTRIBUTION 

The Normal distribution is probably the most important continuous 
random variable used in ASW. It is used to describe the sum of a large 
number of statistically independent random variables. The Normal 


probability density function is defined as: 


0 : 2 Zz 
£(X) = e a(x lM) iG) for all real numbers x, 


it 
Cay" 
where ff and te are the mean and variance. We denote the Normal 


distribution with mean M and variance G* by NCU , GO). Figure 23 


shows a Normal distribution of N(0,1). 


2 





ieuremes: ehownal Distribution (0,1) 
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The probability that a Normally distributed random variable X is less 
than or equal to b, (which is the Cumulative Density Function) is given 


by: 
oo 


ee 
OV 


Evaluation of this intergal can be circumvented by use of standard 


Fy(b) = P[x = b] = “k(x - M)°/of? dx. 


tables. The probability that X is less than or equal to b can be 
represented by the shaded area in Figure 24, and its magnitude is 


determined with the aid of tables in Appendix A. 


£2) 


Paecure 24; The Density Function of the Normal Distrisution 


Note that the Normal density function is dependent only on the two 
parameters y and ies If UU and ‘Of are both specified, probabilities 


can be obtained from the table in Appendix A. If 


2 
f(b) = ar eb /2 
then the random variable X has a Normal distribution with mean O and 
variance 1, and X is called a Standard Normal random variable. Most 
tables are written for the Standard Normal Distribution; therefore, to 
obtain probabilities for any other mean or variance values the ree 
conversions must be made. To compute the probability that X lies between 
a and b, convert a and b into standard units: 


a =e D/C and b' = (b- p)/O 


respectively. 


61 






Then 
Pia= X= bd) = P(a' = x* b') = area under the Standard Normal 
curve between a' and b', where x is the Standard Normal random variable. 
The following problems are used to familiarize the student with the 
use of the Standard Normal distribution N(0,1) and Appendix A: 
1. Let X be a random variable with the Standard Normal distribution 
N(O,1). 
Find: (1) EO Sx = 1.42) Civ) RCO ose= X= ll 26) 
(tii) ee GeO 7.33 ao Xo 0) (v) P(-1.79= X = -0.54) 
Cites) SP les 7 xX. = 2501) Gi) PO 25113) 
op ePtO = X = 1.42) is equal to the 
area under the Standard Normal curve 
between O and 1.42. Thus in Appendix 


A, page A-1, look down the second 


column until 1.42 is reached. The YA 
entry is .4222. Hence P(O = X = 1.42) O 1,42 
= 422. 


(ii) By symmetry, P(-0.73 = X= QO) 


P@mae 0073) = 22673 


~oif3 O 
Gem) P(=1.37 = xX = 2.01) = 
B@b.37 2 xX = 0) + S 


EOP X = 92.01) = 
.4147 + .4778 = .8925 
ei O26 Ol 
(iv) P(0.65 X X= 1.26) = 
PC Omme  X sael.26) < 
RAO X = 0M65) = 


~3926 - .2422 = .1540 O .65 1.26 
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Comet 9 X Sae0.54) = 
Beg 54 = XS 1279) = 


aoc =X. = 1279) = 


PiGasneexeca0.54) = -1.79 -.54 0 


4633 - .2054 = .2579 
(ajeee(X = 1.13) = 
mee ao) =—P(O Sex = 1213) = 
Omir 
mao = .3708 = .1292 


2. Let X be Normally distributed with mean 8 and standard deviation 


to) Find: 
Gor P(5S = X = 10) Citi P er C= 15) 
Gap r (10. x = 15) (iv) P(X = 5) 


(i) P(S = X= 10), must first convert to the standard normal 
distribution N(0,1) 
P(a = XX b) =P(a'= X*= pd’) 
@- WIC 
(b- wW/G 


Enekchoremnave PC=.75 sweX = .5) = P(-.75.4 XS 0) + POS XS 0.5) = 


qa! 


(5 - 8)/4 = -.75 


by (10 - 8)/4 = .5 


2734 + .1915 = .4649 
(ii) P(1O = X= 15) = P(.5 = X” = 1.75) = 
Paget = 1.75) - PCO - X= ..5) = 
4599 - .1915 = .2684 
(iii) P(X = 15) = P(X” = 1.75) = 
Pee 0) 700 = x= 1.75) = 
.5000 - .4599 = .0401 
(iv) P(X = 5) = P(X* = -.75) = 
Ger) = P(=.75 = X— 0) = 
.5000 - .2734 = .2266 
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3. Suppose the temperature T during June is Normally distributed 
with mean 68° and standard deviation 6°. Find the probability p that 


the temperature is between 70° and 80°, ~° 


il 


70° in standard units 


(70 - 68)/6 «33 


80° in standard units (80 - 68)/6 2.00 


Gren p= P(7O = T= 80) = P(0.33= T= 2) = 

Bae eT 2) = eee aet = 0.33) = 

moe = .1293 = 3479 

Here T* is the standardized random variable corresponding to T, so 
T* has the Standard Normal distribution N(COm1). 

4. Suppose that the lifetime of a certain type of electronic tube 
is a Normally distributed random variable. Suppose further that brand 
A has a lifetime X which is N(30 hours, 36 hours’), while brand B has a 
lifetime Y which is N(45 hours, 9 hours“). Of these two brands, which 
would you choose for use in an experimental aircraft with: 

(a) 34 hours mission time 
(b) 40 hours mission time 
(Hint: you should be able to solve these questions by simply sketching 


the two Normal distributions). 


BIVARIATE NORMAL DISTRIBUTION 

The Bivariate Normal and the Time-Varying Normal distributions are 
proposed inclusions to the computer program of the P-3 aircraft for 
tactical, on-station use of probability theory, and although a thorough 
development of the distributions is considered beyond the scope of this 
course, the following cursory description of each of these distributions 
should be sufficient for an intuitive idea of how these distributions 


are used. 
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The Bivariate Normal distribution is the joint probability distribu- 
tion of two Gaussian (or Normal) random saeuamiles. Basically, this 
distribution is the three-dimensional extension of the two-dimensional 
(the value and its frequency) Normal distribution. As seen in Figure 25, 
the Bivariate Normal distribution resembles a hat, and the specific 
Bivariate Normal distribution shown has a central value, which may or 


may not be located at the origin. 





HecUCCeaoee ss Siveriate Normal Dictricution 


The Bivariate Normal distribution is useful in Search Theory, and 
for two-dimensional area coverage. 

If the student is interested in further development of the Bivariate 
Normal, reference should be made to Selected Methods and Models in 


fageeedcy Operations Research, P. W. Zehna, et al, U. S. G. P.0., 1971, 
pages 49 - 6l. 


TIME-VARYING NORMAL DISTRIBUTION 

The Time-Varying Normal distribution describes a Normal random variable 
whose distribution varies as a function of time as well as of another 
parameter such as range. This distribution varies with time, and 
Figure 26 shows an example where the mean is zero for all values of time 


t, but the variance is increasing with increasing t. 
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ty ~-0 


preurme 25:5 Tire-Varyins Menzal Distribution 
The distribution shown in Figure 26 can be used to describe a submarine 
opening a reported datum, where a “datum outbound" situation is assumed 
to exist. As the figure illustrates, the probability that the submarine 
iomiueclose proximity to the reported datum--as reported at time t, or O-- 
decreases with increasing time. This is indicated by the spreading out 
LOE. 


i Z 


It is important to understand the development thus far presented to 


and flattening of the distribution from times t 


provide correct insight into what this distribution represents. Further, 

it is a definite advantage--and, in fact, a necessity for tactical, real- 
time usage--to have a digital computer to perform the mathematics associated 
with the Time-Varying Normal distribution resulting from an operator- 

input assessment of the tactical situation. The computer could then 

output the pertinent probabilities and options for operator use. Again, 

if the operator is to utilize the computer to employ the distribution, 

then it is imperative that he understand the tactical assumptions it is 


based on. 
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CHI-SQUARE DISTRIBUTION 


The eriesquare random variable is useful for ASW applications in that 
it can be used to describe any system that is the sum of the squares of 
Normal independent random variables. Probably the most important of 
these applications is that of describing radial distance, where the 
Gretatice is in terms of two or three directions, each of which has 
distance distributed Normally. Therefore, the medica distance to a 
specific point (from the origin) is the square root of the sum of the 
squares of the cartesian coordinates of the point (by the Pythagorean 
theorem). 

An example which displays the utility of the Chi-Square distribution 
is to obtain the probability that a submarine is within 5 miles of a 
central datum (the origin, for convenience), where it is known that the 
probabilities of the submarine being at a distance in miles along each of 
the x and y axis are each distributed N(0,100) due to error in the system 
that initially located the submarine. The submarine's position can be 
in error (or vary from the stated datum position) in both the x and y 
directions; therefore, its position may be specified by two random 
variables, X and Y. Although there are two random variables involved, 
the effect of using the Chi-Square distribution is a reduction of the 
problem to a single random variable R, which represents radial distance 


from the submarine to the central datum: 


R= Wx? +? 


The problem is to find the probability of the shaded region shown in 


Bacure 27. 
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Fisure 2/7: cum of Squares Lece Than or Equal to 25 Miles 


iheat is, to calculate P | «x? he y2)# ae 5| ; or equivalentLy, 

\e [ x2 +y2 = 25| , we apply the following principle from probability 
theory (refer to Central Limit Theorem on page 142 of Schaum's Outline 
Series, Theory and Problems of Statistics): The sum of n independent 
Normally distributed random variables forms a now random variable known 
as the Chi-Square random variable with 'n degree of freedom"; this is 


denoted by 


In the specific two-dimensional case treated here, if we put 


”) 
Re end i2-y. chen Ext = Xo. 
l 10 2 10 i=1_ 1 Z 


The probability is now denoted by 


p[x2+y¥2 = 25 | = P |102x? + 102x? = 25 | -P(xi+xd < 25 
1 9 1 2~= To 


p[ x] 4+ x? ha 0.25 | , or = B| x5 “= 0.25 | = i C0H25):: 
1 2 2 x? 
2 


The last term is the probability that the Chi-Square random variable 
with two degrees of freedom, is less than orequal to 0.25. This 


cumulative value is found from the "Percentage Points of the Chi-Square 
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Distribution" in Appendix A. This value, F . (0.25) is found by enter- 


Z 
ing the table with n (the left-hand column) value of 2 (the number of 


degrees of freedom-- )2/ is used for n in the table), and reading directly 
to the right until the column is reached that has the tabulated value 
closest to 0.25. That column is then followed vertically upward until 
the top row is reached--the row is labled as Q. The value in this row 
and column is the complement of the desired probability, and it is 

Q x3 (0.25) = 0.90 

1 -Q x? (0.25) = 1 - 0.90; therefore 


F 0.25) = 0.10. 


Now, consider the problem of finding the probability that a 
submarine will be "killed" by a moored mine if it is known that the kill 
capability of the mine is a function of the radial distance between the 
mine and the submarine. This is a three dimensional problem, as the 
submarine depth is also considered. The kill function is known to be 
certain if the submarine is within 30 yards of the mine, and the 
probabilities that the submarine will pass within a given range in each 
of the three coordinate directions (depth, 'North-South," and "East-West"') 
are independently distributed N(0,625) in vards. The problem is to 


calculate P | (x? +? + z*) = 900 | 


| SG] = peony pees 
Ox 25 

X = _Y = Y 

" Gx as 

GC L = & 
Oz 25 


(This technique is that used to normalize and standardize a Normal random 


variable), then 
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tt B4w 
~ 
a 
il 
x, 


if 


So that, 


ela ey? +22 = 30 | = P[ 252(x2 + x3 + x2) = 900] = 


p[x? + x2 + x? = oad SEN TE Se ee = 1.44 | = 
1 2 5 2 1 2 3 
25 
F 9 (1.44) 
X3 


From the table on page A-3, 


9 (1.44) 


Q = 0.75, and 
X3 
F y) (1.44) = 0.25 
3 


Hence, under the assumed probability distributions, the probability that 
the submarine will be "killed" is 0.25. 

There are other methods of solving this problem which involve working 
with the random variables X and Y; however, the Chi-Square method is 
easier, and it is convenient because the probabilities associated with 
it can be simply obtained from a precomputed table. 

The following problems will help familiarize the student with the 
use of the Chi-Square distribution and Appendix A: 

ie Find P [ SUS 21.5 | = F (21.5) 

20 Z 


20 
Answer: 0.75 


= Find P | = 14.7 | 
X50 
Answer: 0.25 | 
3. Find the value of X for which P exe x | = 0.75 


Answer: 29.3 
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4. An antiaircraft shell designed to burst at a specified point in 
Space has x, y, and z errors in a coordinate system with origin at the 
specified point which are independent Normally distributed random 
variables X, Y, Z, each with mean O and standard deviation 50, with 
length measured in feet. Find the probability that the radial error in 
the burst point is at most 100 feet. 


Answer: 0.75 


EXPONENTIAL DISTRIBUTION 


An exponential function is a function of the form £(t) = e*, where 


e is a constant which is very much like 77 , except that e is 
approximately 2.7183, vice 3.1416. For example, if t = 2, then 
£(2) = et = (2.7183)2 = 7.3891, as seen from the last entry opposite 
x = 2.00 in the table of Appendix A, page A-2. The specific exponential 
function of interest here is the negative exponential or f(t) = eel (see 
Appendix A, page A-2 for values of e *) which is the reciprocal of the 
positive exponential shown before, or f(t) = l/et. 

The Exponential distribution itself is actually ee) =e A os 
(t= 0) where A) , the lower case Greek letter "lambda,'' is a positive 
constant. It turns out that 1/) is the mean of this distribution. The 
graph of the Exponential distribution is shown in Figure 28. It is seen 


in the figure that the function (the Exponential distribution) "decays" 


with the progression of time. F 


. ae 






iat 2 f,(t)~ a density function 


—_—— = =p a 






t= 1/>r t 
Ficure 28: The Exponential CDistrisution 
File 





The Exponential distribution can be used to describe many phenomena 
connected with military applications. An Saepie where such a function 
occurs is the MITF (Mean Time To Failure) of an electronic equipment. 

The time axis represents the total lifetime of the equipment itself. 
The random variable X represents the total lifetime; that is, to say that 
X assumes a value ty means that the equipment fails at time t.. 
The cumulative distribution function is 
F(t)=1-e7A tt 2 0 
X 
Therefore, the table in Appendix A, page A-2, is still usable, except 
that the value found in the table must be subtracted from one. Figure 29 


shows the Cumulative Exponential probability distribution. 





Figure 29: The Cumulative Exponential Distribution runction 


Practical ASW applications of the Exponential probability distribution 
are predicting: (1) the mean time to trail a submarine and (2) the mean 
time to fail for ASW equipment. 

The following problems show some of these applications. 

1. Suppose that it is desired to trail a submarine that transits a 
barrier. Take the time that the submarine enters the barrier to be 
t = 0, and let PO be the probability that trail is initiated at t = QO, 
where P, is known. In this problem A is the reciprocal of the mean 


0 


time to lose trail (or the average rate of losing trail). For this 
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problem let 1/A__ be 20 days, and let 25 = (0.5. Whatwis the probabalacy 
the submarine is trailed at time t = 10 days? Since 1/A = 20, 
OR = 1/20 = 0.05; therefore, ‘ 

P(t) =P (1-e7 At) =0.5 (1 - SG. 2 

ORS l= 076). = (025)..(0.4) = 1062 

2. A device is to operate until failure. The pdf in a model describ- 
ing the time of failure is f(t) = 0 <= t, where t is in hours. What 
is the probability the device will fail in the first two hours? 

Answer: In order that e~© be of the form Ae A im A must equal 
one, that is, the cdf is F(t) = 1-e ©. Thus P | Fail Peis G2 hrs | = 
1 - e* = 1 - 0.135 = 0.865 

3. A resistor is installed in a fire control system. The pdf 
describing the time to failure of the resistor is f(t) = Ae A Co«< t 
where t is in vears. If the MTTF is 15 years what is the probability the 


resistor will last 6 years? 


a eal. = 0.07 
MTITF 15 


Ie [ Resistor last 6 years =l1-P | Resistor fails in 6 years | = 

1 - [1-A i t ]-=1- [1 - 0.07 e7 (0.0766) ] 

l - 1 - (0.67) (0.3) =1- 0.8 = 0.2 

4. The reliability of a device during a mission depends on T, the 
time the device has been in storage before the start of the operation. 
In a probability model, A is the event the device survives the mission, 
f(t) = e-© is the pdf of T. Find P(A) for a 6 month mission, where T is 
in years. 


F(t) = 1 - e~*, P(A) = 0.6 
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POISSON DISTRIBUTION 


Many physical phenomena can be described by the Poisson distribution. 
Bmdissdon of particles from a radiating source, arrivals of messages in a 
CIC, and submarines transiting a barrier are a few of the shipboard ASW 
Situations that may require this distribution for analysis of the 
problem. 

The basic notion of this distribution involves a stream of "customers" 
arriving at a "service point" at a constant average arrival rate. It is 
assumed also that arrivals are independent in the sense that the number 
of arrivals in any given time interval is independent of the number of 
arrivals in any other interval which does not overlap the first. 

The Poisson distribution is given by 


p(k, A ) = yk QA ky = 0.61. 2a 
ky 


where A = O is some constant. 

In the applications of the Poisson distribution that are of primary 
interest in ASW, the Poisson distribution represents the number of 
"arrivals" in the given time interval. For example, if time is 
measured in hours, the given time interval is a certain ten-hour period, 


and A= 0.5, then the notation 


9(2,0.5) = (0.5)* e795 = 0.076 
fas 


is interpreted to mean: 

"the probability of exactly two arrivals in the given time period is 
0.076." 

The length of the time interval is not apparent in the expression 


defining p(k, A). Suppose the basic unit of time (this may be a second, 
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a minute, an hour, a aay a year, etc.) has been chosen. Let T be the 
length of the given time interval, i.e. the number of basic time units. 
Then the A in the Poisson distribution has the property that A/T is 
the mean arrival rate, expressed as the number of arrivals per unit of 
time. That is, if we denote the mean arrival rate by qd, then 

a= A/T, or A= QT. 
Therefore, for applications other than the Binomial approximation 


indicated next, we use 


- aT k 
p(k, qty =2 (a7, k= 50 ae 


to express the probabilities for the various possible number of arrivals. 
This distribution has a mean, P = A _ , variance, oka = Xr and 
standard deviation, Che Ve 
In addition to having its own applications, the Poisson distribution 
also provides us with a close approximation of the Binomial distribution 
for small k, provided that p is small and A=np. This is indicated 


in Table 7 where n = 100, p = 1/100 and A = np = 1. 


Preete. 0. ores 
Poiscseon ie S ChylGies iF 


fople (: Comparicon of sinomial and Foisson Values 





The Poisson distribution can probably best be understood through 


examples and problems. 
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rnd (i) e >. Gay 6 2-2 


From Appendix A, page A-2, 
(Cee = 0.273 


(ii) e72-5 = 0.0821 


2. For the Poisson distribution p(k, A ) = Ak en » find 
: k! 
Cr) oa); (ii) pQ3,4), Cie 2. 
(i) p(2,1) = L2e7t = eo"! = 0.368 = 9.184 
Oa 2 2 


(ii) p(3,%) = 0.013 
Cia) pC 2427) = 0.12 
3. Suppose that in a particular region submarine detection times 
obey a Poisson distribution, with a mean detection rate of one submarine 
every 12 days. Calculate the probability that five submarines will be 
detected in the operating area during a quarter (120 days)? 
As indicated above, A = @ T, where Q represents the average 
detection rate (i.e. the average number of detections per day, which is 


1/12), and T the length of the given time interval, so that T = 120). 


Therefore, 
P [ K = 5 | eras oe ann Kk ee - (1/12) (120) (1/12) (120)? 
k! Se 
0.03783 


4. Using the information from problem 3, find the probability of 
detecting five or fewer (at most five) submarines during the quarter. 
1G | & .-(/12) (220) 
P [ = 5 | = Cl 12120) e = 0.6709 
k=0 k! 
5. Approximate the Binomial distribution problem number 4 in the 
Binomial distribution section of this chapter, using the Poisson 


distribution. Answer: 0.9 
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APPENDIX A 


o 


Areas Under the Standard Normal Curve from 0 to 


x 


0.00 


0733 


0.50 


0.54 


0.65 


O273 


1 he: 


P(x) 

.0000 
~ 1293 
1olS 
aoe 
eee 


12673 


- 3708 


x 


7.20 


1.37 


1.42 


| Oe 





P(x) 
O39 02 
0.4147 
0.4222 
0.4599 
0.4633 
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